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Abstract 



In this paper we consider the online ftp problem. The goal is to service a 
sequence of file transfer requests given bandwidth constraints of the underly- 
ing communication network. The main result of the paper is a technique that 
leads to algorithms that optimize several natural metrics, such as max-stretch, 
total flow time, max flow time, and total completion time. In particular, we 
show how to achieve optimum total flow time and optimum max-stretch if 
we increase the capacity of the underlying network by a logarithmic factor. 

We show that the resource augmentation is necessary by proving polyno- 
mial lower bounds on the max-stretch and total flow time for the case where 
online and offline algorithms are using same-capacity edges. Moreover, we 
also give poly-logarithmic lower bounds on the resource augmentation fac- 
tor necessary in order to keep the total flow time and max-stretch within a 
constant factor of optimum. 

1 Introduction 

Consider the problem of sending large files (eg. bitmap images) through a general 
topology network. The requests arrive online and the goal is to eventually satisfy 
all the requests. Since the bandwidth of the links in the network is limited, it 
makes sense to try to schedule the transmissions in a way that utilizes the available 
resources optimally. 

In this paper we consider the online ftp problem, which is a formal abstrac- 
tion of the above file transfer problem. We assume that each ftp request specifies 
source/destination nodes and the size of the file. The goal of the online algorithm 
is to choose a path that will be used for transmitting each file, and to decide on the 
transmission rate. The main difference between this model and the (well-studied) 
models for online routing and admission control [11, 1, 12, 2] is that here we do 
not assume that the sources have prespecified transmission rate requirements, i.e. 
we can deal with non-streaming types of information. 

There are two related measures of performance that can be used to compare 
different algorithms for the online ftp problem. The first measure is the total flow 
time, i.e. the sum over all jobs of the time that elapses between the instant the ftp 
request is submitted and the time it is satisfied (including the transmission time). 
The other measure is the max-stretch, which is the maximum over all ratios of 
the flow time of each request and the smallest time needed to satisfy this request. 
The second quantity is determined by the link bandwidth and the size of the file. 
Both measures are useful since they are directly related to the performance of the 
network perceived by the end-user. 
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Let n be the number of requests and P the maximum ratio between the sizes 
of the files. Assume that the smallest request can be processed in one time unit. 
Let F^ AX denote the optimum max-flow i.e. the smallest value for the maximum 
time a request spends in the system. The main results of the paper are algorithms 
that achieve the optimum max-stretch and the optimum total flow time using re- 
source augmentation 1 . For the max-stretch algorithm we need to increase capaci- 
ties by a factor of 0(log P), whereas the total flow time algorithm needs a factor 
of 0(log F^ AX ) increased capacity 2 . The latter algorithm does not only achieve 
optimum the total flow time, but simultaneously optimizes many other objective 
functions, like the maximum flow time, the total square-of-flow-time, etc. 

To justify the need for giving larger capacities to the online algorithm (i.e. 
resource augmentation), we show polynomial lower bounds on both max-stretch 
and total flow time for the case where both online and offline algorithms are us- 
ing the same capacities. Moreover, we show that in order to achieve constant 
competitive ratio against an adaptive adversary we have to give the online algo- 
rithm £2 (log P/ log log P) factor more capacity for the max-stretch metric, and 
Q (y/log y I log log y) more capacity for the total flow metric, where y = min{«, P}. 

In the context of machine scheduling, total flow time is known to be a hard 
metric to approximate [17] and it is only recently that progress has been made 
towards obtaining algorithms that give total flow time guarantees. In particular, 
logarithmic-factor resource augmentation was used in [20] to obtain optimum flow 
time for machine scheduling. Max-stretch was recently proposed as a good metric 
to measure user satisfaction [5]. Our lower bound on the amount of resource aug- 
mentation needed for max-stretch holds in the machine scheduling model as well, 
and therefore our upper bounds for max-stretch are also of interest in the machine 
scheduling model. 

When proving upper bounds, we restrict our algorithms to use a single rate 
when transmitting a specific file, and do not allow preemption. The competitive 
ratio is computed against an offline algorithm that does not have these restrictions. 
Our lower bounds for online flow-time minimization algorithm without resource 
augmentation (i.e. both the online and the offline algorithms work in the same 
network) hold even if we remove this restriction, i.e. allow the algorithm to use 
time-varying rate when transmitting a file. This contrasts with minimizing flow 
time for machine scheduling, where a log P -competitive preemptive algorithm is 
known [18]. Also, the lower bound for total flow time is achieved using same-size 

'Throughout this paper, when we refer to an optimum solution, we mean the optimum without 
any resource augmentation. 

2 Note that F MAX < nP and therefore log F^, AX < logn + log P. 
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files. This is in contrast to machine scheduling where the unit jobs case is trivial. 

We view the online ftp problem as a special case of the set scheduling problem. 
In this problem we have a set of resources and each job requires a specific subset 
of these resources (or one of a set of subsets). Set scheduling is a natural gen- 
eralization of the machine scheduling problem that was extensively studied under 
several different metrics (See [16] for a survey of offline approximation algorithms, 
and [18, 20, 5, 14, 15, 8, 19] for a sampling of recent results in online algorithms.). 
The set scheduling model is similar to the parallel jobs model studied by [10, 22]. 
We show how to apply several techniques developed in the context of machine 
scheduling to the set scheduling problem (and hence the online ftp problem) for 
simpler metrics such as makespan and total completion time. In particular, we use 
the technique that allows us to convert an offline optimization algorithm that max- 
imizes the number of scheduled jobs into an online algorithm that minimizes total 
completion time [14, 15, 19]. We also develop new techniques that help us attack 
more difficult metrics such as total flow time and max-stretch. 

The techniques developed in this paper can be better understood when com- 
pared to the technique of Hall et al. [14, 15]. There the approach is to use of- 
fline p-approximation algorithms for offline packing problems to construct O(p)- 
competitive online algorithms for average completion time. Our techniques allow 
the transformation of offline packing algorithms that achieve the optimum packing 
using O (p) resource augmentation into online algorithms that achieve the optimum 
flow time using 0(p ■ F^ AX ) resource augmentation. If the online algorithm is not 
required to work in polynomial time, then an optimum offline solution (p = 1) 
can be used. Unlike the work of Hall et al. [14, 15] our techniques apply only 
when jobs are malleable [14, 6, 10, 22] i.e. extra capacity/resources can be used 
to reduce the processing time of jobs. Two such problems are the parallel jobs 
problem [6, 10, 22] and the vector scheduling problem [13, 7, 4]. Using our tech- 
niques we can obtain non-polynomial-time online algorithms for minimizing the 
total flow time for these problems; a detailed discussion of polynomial time online 
algorithms that use resource augmentation to obtain optimum flow time for these 
two problems is deferred to the full version of this paper. 

In Section 2 we explain the models that we use. Section 3 contains the main 
technical contributions of the paper - the lower and upper bounds on the perfor- 
mance of online algorithms using the total flow time and max-stretch metrics. In 
Section 4 we describe online algorithms for the ftp problem using the makespan 
and total completion time metrics. Not all online algorithms in Sections 3 and 4 run 
in polynomial time; polynomial time online algorithms and offline approximation 
algorithms are discussed in Section 5. Section 5 also sketches an offline, polyno- 
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mial time algorithm for minimizing the makespan for the set scheduling problem 
(and hence the online ftp problem) if the rate at which a request is serviced is 
allowed to vary arbitrarily. 

2 Models and Definitions 

In the set scheduling problem there are n jobs and m resources. Job j has an arrival 
time (release date) aj , a processing time pj , and a resource requirement Sj where 
Sj is a subset of S, the set of resources. We define P = max, pj/minj pj. The 
quantity P plays a crucial role in the analysis of our algorithms. As in traditional 
scheduling, both the preemptive and non-preemptive variants are of interest. The 
Set Scheduling Problem can be formulated as either an offline or an online problem. 
As in job shop scheduling and multiprocessor scheduling, the performance of an 
algorithm for this problem can be studied under several different metrics - most 
notably makespan, total completion time, total flow time, and max-stretch. In this 
paper we will concentrate mainly on online algorithms. 

The online ftp problem is defined as follows. We are given a network G = 
(V, E) where all edges have identical bandwidths. Assume that the transmission 
delay along any link is zero, and that there are no buffers in the network. Once 
a source starts transmitting data to another node, the other node starts receiving it 
immediately. Of course the rate at which the sender transmits the data is bounded 
by the minimum available bandwidth along the route over which the transmission 
is taking place. Let m be the number of links in the network, and n the number of 
ftp requests. Request j has an arrival time aj , specifies file size pj , and a route Rj 
over which the data needs to be transmitted. We also address the case where instead 
of the route, the request specifies only the source and the sink nodes. The former 
model is closer to the IP world, where the routes are determined by an external 
algorithm, while the second model is closer to the ATM world, where one can use 
source routing. 

Let Cj be the completion time of job j in a schedule. The quantity Fj = Cj —aj 
is called the flow time of job j. The makespan of a schedule is max, Cf, total 
completion time is ^ - Cj\ total flow time is ^ . Fj and max-stretch is max, Fj/xj 
where t, is the time it would take to satisfy job j if it had the whole network 
to itself. We also permit jobs to have weights Wj. In the presence of weights 
the total completion time and total flow time metrics are defined as J^j w j Cj m & 
^ • Wj Fj respectively. Traditionally, the total flow time and max-stretch metrics 
are considered to be the hardest. These are also perhaps the most interesting metrics 
as they most directly measure end user experience. 
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The following theorem captures the hardness of the set scheduling problem - 
the reduction is straightforward and we omit the details. 

Theorem 1 The Vertex Color problem reduces (via polynomial time reductions) to 
Set Scheduling in an approximation preserving fashion. 

For the vertex color problem lower bounds are known for both the approximation 
ratio (£2(ra 1_e ) unless P=NP [9]) and competitive ratio (£2(ra 1//3 ) [3]). The above 
reduction also holds for the online ftp problem if the routes as well as the transmis- 
sion rates are given as input. Thus to make progress with the set scheduling/online 
ftp problems, we need to relax the model. The first relaxation we propose is to al- 
low rate control for jobs. Thus each job would be assigned a start time sj (sj > aj) 
and a rate r, by the scheduler. The job would execute ^from time Sj to s } - + Pj/rj 
and would consume an rj fraction of each resource in its resource set Sj during 
this interval. More than one jobs may use a resource at the same time. However, 
the total usage of a resource at any time must be at most 1. This relaxation is par- 
ticularly appropriate to the ftp problem: it is possible to control the rate of a TCP 
connection; more than one connections can use the same link; further, a connection 
uses up the same bandwidth on each link along its route 3 . 

3 Flow time and max-stretch using resource augmenta- 
tion 

3.1 Upper bounds with resource augmentation but no preemption 

Assume that all links have the same capacity in the original network; rescale ca- 
pacities so that this capacity becomes 1 . Further rescale time such that the smallest 
request takes four units of time to finish if it has the entire network to itself. Then 
the time required to service the largest request (if the request has the entire network 
to itself) is 4P. 

Let n be the number of requests, and m the number of links. Let K = 3 + 
logrc + log P. We assume that the online algorithm can use a capacity of 5 A" on 
each link. 

3 Instead of allowing a fixed rate rj for each job, we could also allow the rate to vary. It turns out 
that our online algorithms, even though they use just one rate rj for job j, are competitive against 
optimal solutions which are allowed to vary the rate. For offline algorithms it may help to vary the 
rate; we will delve into this a little in Section 5 
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The online algorithm pretends that there are K copies of the network, Go - - - G k - 1 , 
each with edge capacities 5. We call this algorithm MRHP (Most Recent Highest 
Priority) since at any given time, connections which have been waiting in the sys- 
tem the shortest are the most likely to get scheduled. The online algorithm does its 
processing only at integral time instants. Figure 1 describes the behavior of MRHP 
at time t such that t = 2 k .t', where t' is odd. 



for i = 


0 to min{&, K - 1} 


1 . 


Let Sf be the set of requests which arrived in 




the interval [t—2',t) 


2 . 


Find the largest weight subset of 5; that can 




be completed in the network G; between times t 




and t +2' 


3. 


Schedule this subset in Gi such that each re- 




quest has starting time t, finishing time t + 2', 




and a uniform rate during this interval. 



Figure 1: Algorithm MRHP at time t = 2 k .t', where t' is odd. 

The same job may get scheduled by multiple copies of the network. The flow 
time of such a job is taken to be the smallest flow time from all its copies. All 
the jobs ultimately get scheduled by the online algorithm, as the Gk-i alone has 
sufficient capacity to schedule all the jobs. 

Let Wj be the weight of the jth job, and Fj the total time this job spends in 
the system. Let Qk be the total weight of the requests which get scheduled in at 
least one of the networks Gq . . .Gk- Let Q* k be the total weight of all requests that 
have a flow time of at most 2 k+2 in the optimum solution. Let q k = Qk — Qk-i, 
and ql = Q% — Q\_\ (for convenience define Q-i and Q*_ x to be 0.). Each job j 
which contributes to q\ must have a flow time Fj < 2 k+l in the MRHP schedule, 
and each job j which contributes to q£ must have a flow time F* > 2 k+l in the 
optimum schedule. 

Let T* and T denote the total weighted flow times of the optimum and on- 
line algorithms, respectively. Clearly, T* > J2o<k<K ^ k+l 1k- Further, T < 
J2o<k<K-i 2 k+l qic- 

Lemma 2 Q k > Q\ 
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Proof: Let S£ be the set of requests which contribute to Q\. By definition, each 
of these requests has a flow time of at most 2 k+2 . Divide time into intervals of the 
form [i -2 k , (i + 1) • 2 k ) for i > 0. Let Sjf denote the set of requests from S£ which 
arrive during the i-th interval, and let denote their combined weight. All these 
jobs are scheduled by the optimum algorithm to finish before time (i + 1) -2* +2 k+1 . 
Hence all these jobs must arrive and finish in the interval [i ■ 2 k , (i + 1) • 2 k + 2 k+2 ), 
which has length 5 • 2 k . Since Gk has 5 times the original capacity on each edge, 
and since it has all the jobs in available for scheduling during the interval 
[(/ + 1) • 2 k , (i + 2) • 2 k ), it will schedule jobs with a weight of at least during 
this interval. Summing up over all i, Qt > Q\. ■ 

Let g be any function from 31 + to 3l + . Let T* denote the optimum value of 
J2j w jS(Pj) that can be obtained in an unaugmented network, and T s denote the 
corresponding value obtained by MRHP. 

Theorem 3 !F g < J 7 *, for all non-decreasing functions g. 

Proof: Let W = J2j ™j- We define p ( k ) = Qk/W and P*(k) = Q* k /W. P and 
P* are probability measures, and Lemma 2 implies that P* stochastically domi- 
nates P. Theorem 3 now follows from the properties of stochastic dominance - we 
omit the details from this version. ■ 

Theorem 3 is particularly interesting because it shows that MRHP simultane- 
ously optimizes a very wide class of metrics. In particular, the following results 
can be obtained as corollaries. 

Corollary 3.1 T < J 7 *. 

Proof: Let g be the identity function in the statement of Theorem 3 ■ 

Corollary 3.2 Let Fmax denote the maximum flow time ( max-flow ) in the schedule 
obtained by MRHP and F^ AX denote the max-flow in the optimum schedule. Then 
Fmax < F^ AX 

Proof: For p > 0, define T p to be ^ • wj(Fj) p . T* is defined analogously. Theo- 
rem 3 implies that T p < T* for all p > 0. Fmax and F MAX are the limiting values 
of {T p ) xlp and (J r *) l/p respectively as p -> oo. Therefore Fmax < F MAX . ■ 

The average stretch of a job can be mimicked using a total weighted flow time 
objective function with appropriate weights. MRHP does not really need to know 
K in advance - it can maintain an estimate of K and increment this estimate by 
one if and when the current value of K does not suffice to schedule all the requests. 
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Let F^ AX be the optimum max-flow for the given sequence of jobs, given that the 
shortest job takes one unit time to finish if it has the entire network to itself. Notice 
that F^ AX < nP. The following theorem gives a sharper bound on the amount of 
resource augmentation needed by MRHP. 

Theorem 4 MRHP needs 0(log F^ AX ) resource augmentation. Further, F^ AX 
need not be known in advance. 

The above theorem represents a significant improvement, since n can be arbitrarily 
large even in a well behaved system with small max-flow. Section 5 shows how 
to implement the algorithm in expected polynomial time with O (log n + log P + 
logm) resource augmentation. 

We now return to the max-flow metric introduced in Corollary 3.2. The max- 
flow metric (Fmax ) is interesting primarily because it relates to the max-stretch 
metric. We give a simple online algorithm MMF (Minimum Max-Flow) that uses 
only a constant factor resource augmentation. More specifically, MMF uses at most 
five times the capacity of the original network. MMF assumes that the optimum 
max-flow is at least T and at most 2T (Initially, T is assumed to be the time re- 
quired to complete the very first job in the original network.). At times t which are 
multiples of T/2, MMF looks at all requests which arrived during the last T/2 time 
units. It then assigns to each of these jobs a rate which is just sufficient for this job 
to finish in the next T/2 time units. If the load on any edge exceeds five times the 
capacity of that edge in the original network, MMF doubles T, aborts the current 
phase, and waits till the current time becomes a multiple of the new value of T/2. 
The following theorem subsumes Corollary 3.2. 

Theorem 5 The maximum flow time of a job in the schedule produced by MMF is 
no larger than the optimum max-flow. MMF runs in time polynomial in n, m, and 
log P. 

We are now ready to present MMS (Minimum Max-stretch) which uses O (log P) 
resource augmentation and guarantees a max-stretch that is no worse than the op- 
timum max-stretch. We first observe that MMF can be modified to guarantee a 
max-flow that is at most half the optimum value if the amount of capacity on each 
edge is ten times that in the original network. Let p\ be the amount of data transfer 
required by the first job. MMS bunches incoming requests into (at most log P) 
classes, with class i containing all requests which have a data requirement in the 
range [pi -2 l , p\ -2 ,+1 ) (i may be negative as well). There can be at most 2 + log P 
classes. For requests within class i MMS invokes a separate copy of modified 
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MMF. Thus the resource augmentation needed by MMS is 0(logP). Note that 
MMS does not need to know P in advance. The fact that the max-flow obtained 
within each class is at most half the optimum max-flow for that class is sufficient 
to guarantee that the max-stretch obtained by MMS is no more than the optimum 
max-stretch. The following theorem summarizes the claims made in the above 
discussion. 

Theorem 6 MMS uses O (log P) resource augmentation and obtains a max-stretch 
that is no more than the optimum max-stretch. Further, MMS does not need to know 
P. MMS runs in time polynomial in n, m, and log P. 

Note that neither MRHP, nor MMF, nor MMS need to get the transmission 
routes Rj as input. 

Theorem 7 MRHP, MMF, and MMS can obtain optimum values for their respec- 
tive metrics even if the routes Rj are not given as input. 

If routes are not provided as input, MRHP, MMF, and MMS as described above 
would not run in polynomial time. See theorem 15 for the amount of resource 
augmentation needed by polynomial time algorithms. 

3.2 Lower bounds with preemption but without resource augmenta- 
tion 

We show that without extra capacity, the competitive ratio of any randomized on- 
line algorithm which tries to minimize the total flow time (max-stretch, resp.) for 
the data transfer problem against an oblivious adversary can not be bounded by any 
function of the network size. The lower bound for the competitive ratio in terms of 
the number of jobs, n, is £1 (y/n) for both metrics. The quantity P is 1 for the flow- 
time lower bound, and ^/n for the max-stretch lower bound. The lower bounds 
hold even if the online algorithm is allowed to do preemption and use fractional 
capacities on links but the adversary is not. 

Total flow time: Consider the length-3 path A — B — C — D. Assume that all 3 
links have the same bandwidth, u. Each connection will request the same amount 
of data, r. We rescale time so that u = r i.e. each request can be serviced in exactly 
one time unit. 

The adversary first tosses an unbiased coin. If the outcome is "Heads" it chooses 
the link A — B as a special link, else it chooses C — D. During the first time step, 
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the adversary generates k requests from A to C and k from B to D. The adversary 
does not do anything for the next k — 1 time units. Then for the next k 2 time units 
the adversary generates one request per time unit over the special link. 

Lemma 8 The expected flow time of any online algorithm on this sequence must 
be £l(k?), even if preemption is allowed and the online algorithm is allowed to use 
fractional capacities. Further, the optimum flow time for this sequence is 0(k 2 ) 
even without using fractional capacities and preemption. 

Since the number of jobs is n = 2k + k 2 , the competitive ratio of any online 
algorithm must be Q. (s/n) which does not depend on the network size. 

Max-stretch: Consider again the same length-3 path A — B — C — D, with each 
link capacity being u. Again, the adversary first tosses an unbiased coin. If the 
outcome is "Heads" it chooses the link A — B as a special link, else it chooses 
C — D. During the first time step, the adversary generates 1 request from A to C 
and 1 from B to D, each of size ku; for the next k—l time units the adversary does 
nothing. Over the next k 2 time units the adversary generates one request of size u 
every time unit over the special link. 

Lemma 9 The expected max-stretch of any online algorithm on this sequence must 
be £2(k), even if preemption is allowed and the online algorithm is allowed to use 
fractional capacities. Further, the optimum max-stretch for this sequence is 2 even 
without using fractional capacities and preemption. 

The ratio P = p m ax/Pmin for this sequence is k. Since the number of jobs is 
n = 2 + k 2 , the competitive ratio of any online algorithm must be Q (min{P, s/n}) 
which does not depend on the network size. A lower bound of Q.(P 1 / 3 ) for the 
competitive ratio of an online algorithm for the minimum max-stretch problem in 
the context of machine scheduling was presented in [5]. 

3.3 Lower bounds on the amount of resource augmentation 

In this section we give lower bounds on the amount of resource augmentation 
needed for any randomized online algorithm to achieve a constant competitive ra- 
tio. These lower bounds require an adaptive adversary, and assume that the online 
algorithm is not allowed to preempt requests or change the rate at which a request is 
being serviced. Notice that our upper bounds all work against adaptive adversaries, 
and do not preempt requests. 
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Theorem 10 Against an adaptive adversary, any randomized online algorithm 

that achieves constant competitiveness for max-stretch must use £2 (minfra, log P / log log P)) 

resource augmentation. 

Proof: The adversary uses a one link network with capacity 1. Let u be the re- 
source augmentation that the online algorithm uses and let k be a parameter chosen 
suitably below. The sequence of requests created by the adversary consists of sub- 
sequences Aq, A\, . . . , Af, for some / > 0. The beginning of a new subsequence 
A; is called a restart. Initially i = 0. Each subsequence A,- consists of requests of 
size Lj, one every L ; time units where = (I6uk) 3u ~ l . Define a i-phase to be a 
time interval between the ith and the /+lst restart during which no new jobs of A; 
arrive and no old jobs of A, are completed by the online algorithm. Since the algo- 
rithm is not allowed to vary the rates, the adversary can determine at the beginning 
of an / -phase how long the /-phase would last if no new job arrived. The adversary 
also knows the bandwidth utilization of the online algorithm during the /-phase. If 
the adversary encounters an /-phase that would last at least L,/(8m) time units and 
were jobs of A,- use more than 1/3 units of bandwidth, the adversary increments / 
and it restarts. If the adversary does not encounter such an /-phase, it stops when 
A, consists of k jobs. 

Note that whenever the adversary restarts, the bandwidth available to the online 
algorithm for jobs created after the restart is reduced by at least 1/3. Thus the 
adversary restarts at most 3u times, i.e. / < 3u. It can be shown inductively that 
the optimum algorithm can schedule all jobs in U;>, A/ (ie all jobs of size less than 
Li) in time at most L,-. Hence delaying the last job of each size by its size gives an 
algorithm with max-stretch at most 2. 

We show next that the max-stretch of the online algorithm is at least k. Let 
Lf = {\6uk) 3u ~l be the size of the shortest jobs generated by the adversary. 
When the adversary creates jobs of size Lf no /-phase exists of length at least 
Lf/(&u) where jobs of Af use more than 1/3 units of bandwidth. Since k jobs of 
size Lf are created, there are at most 2k /-phases. The total amount of data of jobs 
in Ay transferred during /-phases where the jobs in Ay use more than 1/3 units of 
bandwidth is at most 2k ■ Lf/(Su) ■ u = Lfk/4. We consider next /-phases where 
the jobs in A/ use at most 1/3 units of bandwidth. During the first 2k Lf time units 
of these /-phases at most 2kLf/3 data of jobs in Ay is transferred. Therefore the 
total amount of data of jobs in A / transferred by the online algorithm during the 
first 2k Lf time units since the last restart is at most llkLf/12. Hence, there are 
some jobs of Af left unfinished at time 2k Lf and therefore, there must be some job 
with a stretch of k. 
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It follows that the competitive ratio is at least k/2. Note that the ratio P of the 
maximum job size to minimum job size is (16w&y and that the number n of jobs 
is at most fk. Since / < 3u, n < 3ku and P < (I6uk) 3u . If the competitive ratio 
is a constant, both n/3u and P 1 ^ 3 "^ /(\6u) must be a constant. The first condition 
translates to u = Q(n) and the second translates to u = £2 (log P /log log P). 
Therefore u = £2(min(ra, log P / log log P)). 



Theorem 11 Let y = min{«, P}. Against an adaptive adversary, any randomized 
online algorithm that achieves constant competitiveness for Total Flow Time must 
use Q (-y/log y I log log y) resource augmentation. 

The proof of Theorem 1 1 uses an argument similar to the proof of the previous 
theorem, and is omitted from this version. 

4 Online Algorithms for Makespan and Total Completion 
Time 

Standard techniques can be used to obtain constant competitive online algorithms 
for makespan and average completion time for the online ftp problem without the 
use of resource augmentation. 

Makespan: Define A as the maximum over all edges, e, of the amount of data 
that needs to be transferred over e. We rescale time so that one unit of data can 
be transferred over a link in one unit of time. Let aMAX be the time at which the 
last request arrives. Let L be the quantity max(«MAX, A). L is a lower bound on 
the makespan of any schedule. The online algorithm maintains a guess A for the 
value of L. We assume that the first request arrives at time 0. The initial value of 
A. is set to p\, the amount of data transfer needed by the first request. Each time a 
request arrives, the algorithm recomputes L. If L > A, A is reset to max(L, 2A). 
The online algorithm schedules the newly arrived request to execute from time A 
to 2k, with a rate of 1/A. It is easy to see that the above algorithm does not violate 
capacity constraints. Let U represent the final value of A; by construction U is at 
most 2L. The makespan is at most 2U + U + U/2 + . . . < 4U. We can now claim 
the following result. 

Theorem 12 The above algorithm is ^-competitive. 

The above algorithm runs in polynomial time and hence, is also an offline ap- 
proximation algorithm. However, an offline algorithm "knows" the exact value of 
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L and hence can provide an approximation guarantee of 2. If routes are not given 
as part of the input, a slight variant of the above online algorithm can still obtain 
an 8 -approximation, but it would not run in polynomial time. 

Total Completion Time: The general scaling technique outlined by Hall et al. [15, 
14] directly results in a 4-competitive online algorithm for the total completion time 
metric, regardless of whether routes are given as part of the input. Their technique 
requires an offline algorithm that can pack an optimum number of requests into a 
given interval. This problem is NP hard, and therefore, our online algorithm does 
not run in polynomial time. An O (log m) -competitive polynomial time algorithm 
is outlined in Section 5. 



5 Polynomial Time Approximation and Online Algorithms 

In this section we give offline algorithms for total completion time, makespan, total 
flow time, average stretch, maximum flow time, and maximum stretch that run in 
polynomial time. The algorithms for total completion time and makespan approx- 
imate the optimum performance without resource augmentation. The algorithms 
for the remaining metrics achieve optimum performance using either a constant- 
factor or a polylogarithmic-factor resource augmentation. We conclude the section 
by giving polynomial-time algorithms with optimum makespan under two different 
relaxations of our model: (1) We relax the condition that the rate of a job has to be 
constant: we give a polynomial-time algorithm that varies the rates and achieves 
optimum makespan. (2) We assume that the start time sj is part of the input and 
show that then the problem can be solved in polynomial time. 

Theorem 13 There exists an algorithm that achieves an 0(log m) -approximation 
of the total completion time for the online ftp problem in time polynomial in n and 
m, regardless of whether routes are given as part of the input. 

Proof: Consider the problem of maximizing the number of ftp requests (out of a 
given set of requests, all of which have the same arrival time) that can be scheduled 
over a given period of time. An O(logm) approximation to this problem can be 
obtained using multicommodity flow followed by randomized rounding [21]; plug- 
ging this into the general technique of Hall et al. [15, 14] results in an O logm)- 
competitive polynomial time online algorithm for the total completion time of ftp 
requests. ■ 

A polynomial time 2-approximation for the makespan of the ftp problem when 
routes are given as part of the input follows from the discussion in Section 4; a 
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simple randomized rounding trick results in an 0(log m)-approximation if routes 
are not provided as input. 



Theorem 14 There exists an algorithm that achieves a 2-approximationfor makespan 
in time polynomial in n and m if routes are given as part of the input, and O (log m) 
otherwise.. 

We now describe how to implement algorithm MRHP in polynomial time. The 
only step of MRHP which might take super-polynomial time is step 2, finding the 
largest weight subset A; of 5 ; - that can be completed between times t and t + 2 l . 
To implement it in expected polynomial time we need to add log m + 2eK to the 
capacity of each edge, where K = logra + log P + 3. 

We use first a linear programming relaxation of the problem, then round it prob- 
abilistically and finally show that with high probability no edge capacity constraint 
is violated. The linear program uses for each job j a variable xj and maximizes 
X!|eS; w j x j un der the constraint that for each edge e, ^ uses e XjPj/2' < 1 and 
that for each j, xj > 0. Let x* denote the value of Xj in the solution. We 
probabilistically round each job j for each network i such P(j e A,) = x*. 
Let X e be the random variable denoting the load of edge e in G. The expected 
value \x of X e is Z~2o<i<:K Z~2j uses e x jPj/2' < K- Using Chernoff bounds with 
8 = (log m + 2eK)/n — 1 shows that 

Pr(X e > logm+2^) < ( < (- __)log«+^ < \/( m (nP) 2e ). 

(1 + 8) l+d logm + 2eK 

Thus, the probability that one of the edge capacities overflows is at most \/{nP) 2e 
in which case we simply redo the rounding step. 

Note that both MMF and MMS already run in time polynomial inn, m, and 
log P if routes are given as part of the input. The same ideas that we outlined above 
for total flow time also result in polynomial time algorithms for the max-flow and 
max-stretch problems when routes are not given as input. 

Theorem 15 There exist (online or offline) algorithms that run in time polynomial 
in n, m, and log P and 

• achieve optimum total flow -time or average stretch with an expected O (log n+ 
log P + log m) -factor resource augmentation regardless of whether routes 
are given as part of the input; 

• achieve optimum maximum flow time with a constant-factor resource aug- 
mentation if the routes are given as part of the input, and expected O (log n + 
log m) resource augmentation otherwise; 
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• achieve optimum maximum stretch with an 0(log P)-f actor resource aug- 
mentation if the routes are given as part of the input, and expected O (log n + 
log m + log P) resource augmentation otherwise. 

We finally relax some of our conditions. Consider first the case that the rate of 
jobs can vary. Assume that the optimum makespan is M. We present a linear pro- 
gram that given M checks whether there exists a feasible solution. By performing 
a binary search over M, with 0 < M < nP and assuming that time is rescaled so 
that the shortest job takes one time unit, we get a polynomial time algorithm that 
finds the optimum makespan. 

We assume wlog that the first job arrives at time 0. Break the time from 0 to 
M into intervals whenever a new job arrives and number the time intervals from 
1 to n. Let lj be the length of interval i and let aMAX be the arrival time of the 
last job. Note that the length of the last interval is M — a max- There is a variable 
Xjj for each interval i and each job j . The linear program checks whether there 
is a non-negative assignment for the variables Xjj such that (1) for each job j, 
J2i Xjjk > pj, (2) for each edge e and interval i, J2j uses e x Ui < 1, and (3) for 
each job j and interval i such that j arrived after i, xjj = 0. 

The linear program can be slightly modified to give a polynomial-time algo- 
rithm in the case that the start time sj is given for each job j. 
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